BBN System Description for WMT10 System Combination Task

نویسندگان

  • Antti-Veikko I. Rosti
  • Bing Zhang
  • Spyridon Matsoukas
  • Richard M. Schwartz
چکیده

BBN submitted system combination outputs for Czech-English, German-English, Spanish-English, French-English, and AllEnglish language pairs. All combinations were based on confusion network decoding. An incremental hypothesis alignment algorithm with flexible matching was used to build the networks. The bi-gram decoding weights for the single source language translations were tuned directly to maximize the BLEU score of the decoding output. Approximate expected BLEU was used as the objective function in gradient based optimization of the combination weights for a 44 system multi-source language combination (All-English). The system combination gained around 0.42.0 BLEU points over the best individual systems on the single source conditions. On the multi-source condition, the system combination gained 6.6 BLEU points.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incremental Hypothesis Alignment with Flexible Matching for Building Confusion Networks: BBN System Description for WMT09 System Combination Task

This paper describes the incremental hypothesis alignment algorithm used in the BBN submissions to the WMT09 system combination task. The alignment algorithm used a sentence specific alignment order, flexible matching, and new shift heuristics. These refinements yield more compact confusion networks compared to using the pair-wise or incremental TER alignment algorithms. This should reduce the ...

متن کامل

CUED Submission for the WMT10 Translation Shared Task

This paper describes the Cambridge University Engineering Department (CUED) system for the ACL 2010 fifth workshop on statistical machine translation (WMT10). We participated in the FrenchEnglish and Spanish-English translation shared tasks in both directions. The CUED system is a hierarchical phrase-based system that uses finite-state transducers and lattice rescoring. In the French-English ta...

متن کامل

Expected BLEU Training for Graphs: BBN System Description for WMT11 System Combination Task

BBN submitted system combination outputs for Czech-English, German-English, SpanishEnglish, and French-English language pairs. All combinations were based on confusion network decoding. The confusion networks were built using incremental hypothesis alignment algorithm with flexible matching. A novel bi-gram count feature, which can penalize bi-grams not present in the input hypotheses correspon...

متن کامل

Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation

This paper presents the results of the WMT10 and MetricsMATR10 shared tasks,1 which included a translation task, a system combination task, and an evaluation task. We conducted a large-scale manual evaluation of 104 machine translation systems and 41 system combination entries. We used the ranking of these systems to measure how strongly automatic metrics correlate with human judgments of trans...

متن کامل

Machine Translation System Combination by Confusion Forest

The state-of-the-art system combination method for machine translation (MT) is based on confusion networks constructed by aligning hypotheses with regard to word similarities. We introduce a novel system combination framework in which hypotheses are encoded as a confusion forest, a packed forest representing alternative trees. The forest is generated using syntactic consensus among parsed hypot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010